A Unified Framework fro Managing Provenance Information in Translational Research
نویسندگان
چکیده
Background: A critical aspect of the NIH Translational Research roadmap, which seeks to accelerate the delivery of “bench-side” discoveries to patient’s “bedside,” is the management of the provenance metadata that keeps track of the origin and history of data resources as they traverse the path from the bench to the bedside and back. A comprehensive provenance framework is essential for researchers to verify the quality of data, reproduce scientific results published in peer-reviewed literature, validate scientific process, and associate trust value with data and results. Traditional approaches to provenance management have focused on only partial sections of the translational research life cycle and they do not incorporate “domain semantics”, which is essential to support domain-specific querying and analysis by scientists. Results: We identify a common set of challenges in managing provenance information across the pre-publication and post-publication phases of data in the translational research lifecycle. We define the semantic provenance framework (SPF), underpinned by the Provenir upper-level provenance ontology, to address these challenges in the four stages of provenance metadata: (a) Provenance collection during data generation (b) Provenance representation to support interoperability, reasoning, and incorporate domain semantics (c) Provenance storage and propagation to allow efficient storage and seamless propagation of provenance as the data is transferred across applications (d) Provenance query to support queries with increasing complexity over large data size and also support knowledge discovery applications We apply the SPF to two exemplar translational research projects, namely the Semantic Problem Solving Environment for Trypanosoma cruzi (T.cruzi SPSE) and the Biomedical Knowledge Repository (BKR) project, to demonstrate its effectiveness. Conclusions: The SPF provides a unified framework to effectively manage provenance of translational research data during pre and post-publication phases. This framework is underpinned by an upper-level provenance ontology called Provenir that is extended to create domain-specific provenance ontologies to facilitate provenance interoperability, seamless propagation of provenance, automated querying, and analysis.
منابع مشابه
A Framework for Policies over Provenance
Provenance captures the history of a data item. This ensures the quality, the trustworthiness and the correctness of shared information, but the provenance may contain sensitive information so we may need to hide it. Sometimes we need access control policies to protect sensitive components and allow access based on certain properties. In other cases, we may need to share provenance but use reda...
متن کاملGProM - A Swiss Army Knife for Your Provenance Needs
We present an overview of GProM, a generic provenance middleware for relational databases. The system supports diverse provenance and annotation management tasks through query instrumentation, i.e., compiling a declarative frontend language with provenance-specific features into the query language of a backend database system. In addition to introducing GProM, we also discuss research contribut...
متن کاملManaging the Deluge of Scientific Data
Provenance information in eScience is metadata that's critical to effectively manage the exponentially increasing volumes of scientific data from industrial-scale experiment protocols. Semantic provenance, based on domain-specific provenance ontologies, lets software applications unambiguously interpret data in the correct context. The semantic provenance framework for eScience data comprises e...
متن کاملUnified Platform for Secure Networked Information Systems
In this paper, we present a unified declarative platform for specifying, implementing, analyzing and auditing large-scale secure information systems. Our proposed system builds upon techniques from logic-based trust management systems, declarative networking, and data analysis via provenance. First, we propose the Secure Network Datalog (SeNDlog) language that unifies Binder, a logic-based lang...
متن کاملThe Role of Computer Anxiety in Acceptance of Iranian Public Library Management System Based on the Unified Theory of Acceptance and Use of Technology
Purpose: The main purpose of this study was to measure the acceptance and role of computer anxiety among the users of public libraries in Kerman province while using Iranian Public Library Management System (SAMAN) within the framework of the unified theory of acceptance and use of technology (UTAUT). Method: This is an applied study in terms of purpose and a descriptive study conducted using ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015